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Abstract. Let A be a subset of positive relative upper density of P d , the d-tuples of primes. We 
prove that A contains an affine copy of any set e C Z d , as long as e is in general position in the 
sense that the set e U {0} has at most one point on every coordinate hyperplane. 



1. Introduction. 

1.1. Background. The celebrated theorem of Green and Tao [I] states that subsets of positive 
relative upper density of the primes contain an affine copy of any finite set of the integers, in 
particular contain arbitrary long arithmetic progressions. It is natural to ask if similar results 
hold in the multi-dimensional settings, especially in light of the multi-dimensional extensions of the 
closely related theorem of Szemeredi [8] on arithmetic progressions in dense subsets of the integers. 
Indeed such a result was obtained by Tao [S], showing that the Gaussian primes contain arbitrary 
constellations. In the same paper the problem of finding constellations in dense subsets of F d was 
raised and briefly discussed. 

The difficulty in this settings comes from two facts. First, the natural major ant of the (i-tuples of 
primes is not pseudo-random with respect to the box norms, which replace the Gowers' uniformity 
norms in the multi-dimensional case. This may be circumvented by assuming the set e is in general 
position as described below, as is already suggested in [9]. However even under the this non- 
degeneracy assumption, the so-called correlation conditions in [3] do not seem to be sufficient, and 
a key observation of this note is to use more general correlation conditions to obtain the dual 
function estimates in the multi-dimensional case. Also, we need to use an abstract transference 
principle due to Gowers [3] and independently to Reingold, Trevisan, Tulsiani and Vadhan [7], see 
also Tao and Ziegler |10j . 



1.2. Main Results. Let e = {ei, . . . , e;} € be a set of vectors; a constellation defined by e 

is then a set e' = {x, x + te\, . . . ,x + tei} where t ^ is a scalar, that is and affine image of the set 
eU{0}. 

Definition 1.1. We say that a set of I vectors e € is in general position, if |7Tj(eU{0})| = 1 + 1 
for each i, where 7Tj is the orthogonal projection to the i th coordinate axis. 

Let us also recall that a subset A of the (i-tuples of primes F d is of positive upper relative density 

if 

\An[i,N] d \ 

Inn sup ——; — > 

Our main result is then the following 

Theorem 1.1. Given any set A <^P d of positive relative upper density, we have that A contains 
infinitely many constellations defined by a set of vectors e S in general position. 

Remarks: We note that for d = 1 this translates back the above described theorem of Green and 
Tao [I], as any finite subset of Z is in general position. 
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Also, one may assume that I = d and the set e = {e±, . . . ,e d } C Z rf forms a basis in IR rf 
besides being in general position, by passing to higher dimensions. Indeed, if e € (Z rf )' then let 
{/lj ■ • ■ , /i} C Z' be linearly independent vectors, and define a basis 

e' = {e[ = (ei, /i), . . . , ej = (e/, fi), e' l+1 , . . . , e' l+d } C Z d+Z by extending the linearly independent 
set of vectors e\ = (ei,fi), (1 < i < I). If e was in general position then it is easy to make the 
construction so that e' is also in general position, and if the set A' := A x F l contains a constellation 
x' + te', then j4 contains x + te. Thus from now on we will always assume that e is also a basis of 
R d . 

Theorem 11.11 may be viewed as a relative version of the so-called Multidimensional Szemeredi 
Theorem |1], stating that any subset of Z d of positive upper density contains infinitely many 
constellations defined by any finite set of vectors e C Z rf . As is customary, we will work in the 
"finitary" settings, when the underlying space is the group Z% = (Z/iVZ) d , N being a large prime. 
In this settings we need the following, more quantitative version: 

Theorem A (Furstenberg-Katznelson [1]). Let a > ; d G N and let e = {ei, . . . , e^} C 7L d N be a 
fixed set of vectors. If f : 7L d N — > [0, 1] is a given function such that K(f(x) : x £ Z^) > a, then 
one has 

E{f{x)f(x + te x ) ...f{x + te d ) : x G Z%, t G Z N ) > c{a, e) (1.1) 
where c(a, e) > is a constant depending only on a and the set e. 

Here we used the "expectation" notation: K(f(x) : i£i) = m X]zeA 

In the relative settings, when A C F d , the condition: E(/(x) : x G Z^) > a (after identifying 
[l,iV] rf with Z^-) does not hold for the indicator function / = 1a, however it holds for / = l^A^ 
where A d is the <i-fold tensor product of the von Mangoldt function A. The price one pays is that 
the function / is no longer bounded uniformly in N. Following the strategy of [3] we will show that 
the d-iold tensor product ® d v of the pseudo-random measure v used in [1] is sufficiently random in 
our settings in order to apply the transference principle of [3]; we will refer to such measures v as 
d-pseudo-random measures. We postpone the definition of d-pseudo-random measures to the next 
section, but state our main result in the finitary settings below: 

Theorem 1.2. Let a > be given, and d be fixed. There exists a constant c(a, e) > such that the 
following holds. If < / < [i is a given function on 7L d N such that [i = ® d v where v is d-pseudo- 
random, and E(/(x) : x G Z^) > a, then for any basis e = {e±, ...,e^} in general position, we have 
that 

E(f(x)f(x + t ei )...f(x + te d ) : x G Z%, t G Z N ) > c(a, e) (1.2) 

1.3. Norms, Transference, and Pseudo-random Measures. First we introduce the cf-dimensional 
box norms. We actually introduce one norm for each linearly independent set of vectors 
{e u ...,e d }CZ d N . 

For a function / : Z^ — > C this norm with respect to a basis e is given by 

Wf\\n(ey = E ( II f(x + ojte):xeZ d N , teZ d N ) 
u)e{o,i} d 

with the notation cote = witiei + ... + oj^d^d- 

That this is actually norm is not immediate, but for the standard basis it can be shown by 
repeated applications of the Cauchy-Schwarz inequality, similarly as for the Gowers norms (see for 
example [2]). For a different basis, note that we have ||/||n( e ) d = 11/ ° T\\ a d for an appropriate 
linear transformation T, where ||/|[rjd is the norm with respect to the standard basis. The same 
way one shows [2] that the analogue of the so-called Gowers-Cauchy-Schwarz inequality holds 
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Proposition 1.1. ( D d (e)-Cauchy-Schwarz inequality) 

Given 2 d functions, indexed by elements of {0,l} d , we have 



(f u : ui € {0, \} d ) = E( ]"[ / u ( I + W fe): 3 :eZ^G4)< [] ll/«lb(e)< 




Gowers presents an alternative approach to the Green- Tao Transference Theorem from a more 
functional analytic point of view, making use of the Hahn-Banach Theorem. The specific version 
he provides will be presented below after we recall some definitions. First we note that || • ||* is the 
defined to be the dual norm of 1 1 • 1 1 . 

Definition 1.2. Let \\ • \ \ be a norm on % = L 2 (Z„) such that < ||/||*, and let X C % 

be bounded. Then \\ ■ \\ is a quasi algebra predual (QAP) norm with respect to X if there exists 
an operator V : % — > %, a positive function c onR and an increasing positive function C on M. 
satisfying: 



This definition in enough to state the transference principle. 

Theorem B. ( Gowers [3] j Let \x and uj be non-negative functions on Y, Y finite, with \ \^\\l 1 A\ U} \\l 1 — 
1, and rj, 5 > be given parameters. Also let \ \ ■ \ \ be a QAP norm with respect to X , the set of all 
functions bounded above by max{f(,w} in absolute value. There exists an e such that the following 
holds: If we have that \ \fj, — u\\ < e, then for every function with < / < fi there exists a function 
g with < g < w/(l — 5) and \ \f — g\\ < rj. 

Remarks: By a simple re-scaling of the norms the constants 1 in Definition 11.21 and Theorem B 
can be replaced by any other fixed constants. The actual form given by Gowers is more explicit, 
in fact giving a specific choice of e. However, for our purposes, we only need such an e that is 
independent of the size of Y. Also, for our purpose one may choose u = 1 and 5 = 1/2. 

The definition of a pseudo-random measure in this paper will be slightly stronger than that of 
Green and Tao, adapted to the higher dimensional settings. Let us begin with the one dimensional 
case. Following we define a measure to be a function v : 7Ln — > M. to be a non-negative function 
such that 



where the o(l) notation means a quantity which tends to as iV ^ oo. A measure will be deemed 
pseudo-random if it satisfies two properties at a specific level. The first of these is known as the 
linear forms condition, as we will use only forms with integer coefficients we need a slightly simplified 
version. 

Definition 1.3. (Green-Tao [Ijj Let v be a measure, and mo,io € N be small parameters. Then 
v satisfies the (mo, to) -linear forms condition if the following holds. For m < mo and t < to 
arbitrary, suppose that {-C'i,j}i<i< m ,i<j<t o- re integers, and that bi are arbitrary elements ofZ^. 
Given m linear forms <f>i : 7l} N — > Z^v with 



3=1 

x = (xi,...,xt) and b = (b\,...,bt), if we have that each ipi is nonzero and that they are pairwise 
linearly independent, then 



(i) {f,Vf) <l/or all f eX, 

(ii) (f,Vf) > c(e) for every f G X with \ \f\\ > e, and 
(OA) \\Vh...Vf K \\* < C{K) for any fi,...,f K G X. 



E(u(x) : x € Zjv) = 1 + o(l). 





(1.3) 
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where the o(l) term is independent of the choice of the hi 's. 

The next condition is referred to as the correlation condition. 

Definition 1.4. Let v be a measure. Then v satisfies the (mo, mi) correlation condition if for 
every 1 < m < mo there exists a function r = r m : Z^r — > such that for all fceN 

E(r fe (x) : x G Z N ) = O m , fe (l) 

and also 

(mi m \ m / \ 

i=lj'=l / i=l \l<i<i'<m / 

where the functions (pi : Z r N — >■ Z^r are pairwise independent linear forms. 
Remarks: 

This is a stronger condition that what is used in [4], in fact they used the special case when 
mi = 1, and <fi is the identity. We define below a (i-pseudo-random measure to be a measure 
satisfying these conditions at specific levels. 

Definition 1.5. We call a measure v a d-pseudo-random if if v satisfies the 

((d 2 + 2d)2 d ~ 1 , 2d 2 + d) -linear forms condition and the (d,2 d )- correlation condition 

We will deal with d-fold tensor product of measures, /i = <S>f = i^ and call them <i-measures. We 
will call such a d-measure /i to be pseudo-random if the corresponding measure v is (i-pseudo- 
random. Finally, note that for a d-measure 



E(/i(x) :x£Z d N ) = f[E(v(xi) : Xi G Z N ) = 1 + o(l). 



i=l 

1.4. Outline of the Paper. In Sections 2-3 we prove two key propositions, the so-called gener- 
alized von Neumann inequality and the dual function estimate. The first roughly says that the 
number of constellations defined by a set e is controlled by the appropriate box norm. The second 
is the essential step in showing that the box norms are QAP norms. 

In Section 4, we prove our main results assuming that the measure exhibited in 0] is also d- 
pseudo-random in the sense defined above. First we show Theorem 11.21 which follows then easily 
from the Transference Principle, that is from Theorem B. Next, we prove Theorem ll.ll bv a standard 
argument passing from Zjy to Z. 

Finally, in an Appendix, we prove d-pseudo-randomness of the measure v used by Green and 
Tao, slightly modifying their arguments of Sec. 10 in [4 J based on earlier work of Goldston and 
Yildrim [5] [6]. 

2. The Generalized von Neumann inequality. 

Let e = {ei, . . . , e^} C Z d N be a base of Z d N which is also in general position, which in this settings 
means that |7Tj(e U {0})| = d + 1 for each i where 7r, : Z^ — > Zjv is the orthogonal projection to the 
z-th coordinate axis. 

Proposition 2.1. (Generalized von Neumann Inequality) 

Let w = otimes d u be a pseudo-random d-measure. Given a function < / < w, we have that 

Af :=E(f(x)f(x + te 1 )...f(x + te d ) : x G Z d N ,t G Z N ) = 0(\\f\\ n{e , )d ) (2.1) 

where e' = {e d ,e d - e t , ...,e d - e d -i}. 
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Proof. We shall apply the Cauchy-Schwartz inequality several times. Begin by writing 

d 

Af = A = E(/(x) 11 f(x + he t ) : x G Z%, t\ G Z N ). 
i=i 

Push through the summation on t\ and split the / to write this as 

d 

JL{^W)n^W)Y[f{x + he i ):t 1 e'l N )-.xe'l d N ). 
i=i 

Applying Cauchy-Schwartz to get 

d d 

A 2 <E{w{x)]\f{x + t 1 e i )]\f{x + t 1 e j + t 2 e 3 ) :t u t 2 £Z N ,xeZ d N ), 

i=l j=l 

where we have made the substitution t 2 i— > ti + £2 for the new variable. Note that there should 
be a E(io(x)) = 1 + o(l) multiplier, following from the fact that / < w and from the linear forms 
condition, but for convenience we suppress it and will continue to do so (this is a big O result, so 
this is not of any consequence). We make one further substitution, x i-> x — t±ei, yielding 

d 

A 2 < E(w(x-iiei) Yl EI fix+he^+ojt^ei) [J f(xWt^e 1 ):t 1 ,t 2 eZ N ,xeZ d N ), 

i=2wg{0,l} w'e{0,l} 

where we have introduced the notations e>p = ei — ej, and iW = {t 1+J }* =1 . Note that the final 
product of this expression is independent of t\. 

We now repeat this procedure exactly, pushing through the t\ sum and splitting the terms 
independent of ti, followed by a change of variables. After / applications of Cauchy-Schwarz 
inequality, we claim to have 

d 

A 2 ' <E(W,(x, t,+i) J] J] fix + hef +ojt^e i . l )) 

i=l+l o;G{0,l} ( 

x JJ /(x + w'*We, i ,_i):ti,...,*, + i€Z JV ,x€Z^), (2.2) 
w'e{o,i}' 

for an appropriate weight function Wj which is a product of iu's, evaluated on linear forms which 
are pairwise linearly independent. 

The notations introduced here are e^i = {ej, ^} (note that Z > 1), and ujt^e^i = 

uj\t 2 ei +u 2 he~P + ... + uJiU +1 ef X) . 

To check this form, using induction, apply the Cauchy-Schwarz inequality one more time with 
the new variable t\ + ti +2 to get 

A 2 ' +1 < E{W t (x, h, t l+1 )Wi(x, h + t l+2 , t l+1 ) 

d 

x [J II f( x + *i e ? ) + w * (0e i;i)/(* + tief + *z +2 ef ) + ^e^) 
i=l+i uje{o,i} 1 

x Yl wix + co't^e^) :t 1 ,...,t l+2 eZ N ,x eZ%). 
uj'e{o,i} 1 

Write 

W/ + i(Mi,---,*H-2) = Wi(x,t 1: ...,ti+i)Wi(x,ti +ti +2 ,...,ti +1 ) Y\ w ( x + w'i^ew-i). 

w'e{o,i}' 
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(2.3) 

We now apply the substitution x i-> x — hef^, note that ef^ — ef^ = ef +1 \ and set 

W l+1 {x,t 1 ,...,t l+2 ) = W( +1 {x-t 1 efl 1 ,t 1 ,...,t l+2 ), (2.4) 
This gives 

d _ 

A 2 ' +1 <E(W l+1 (x,h,...,t l+2 )x J] [] /(* + t ie ? +1) W m) e J;i+ i) 

i=l+2 u e{0,l} l + 1 

x Yl f{x + co't^e l+1 .i):t 1 ,...,ti +2 eZ N ,xeZ d N ). 
w'e{o,i} i+1 

and this is the form we wanted to obtain. 

After d — 1 iterations, one arrives at the form 

A 2 "' 1 < E(W d _i(x, ti, t d ) JJ /(x + w't^-^e^d-i) :h,, ...,t d eZ N ,xe 1%). 

w'G{0,l} d 

This may be written as 

A 2d_1 <E( 11 f(x + u't^e d , d ^) :t 2 ,...,t d eZ N ,x eZ d N )+E, 
w'e{o,i} d 

where 

E = E((W d _i(x, ti, i d ) - 1) [J /(x + u/t^e^d-i) : ti, t d G Zjy, x G Z&). 

w'e{o,i} d 

To see that the main term is in fact an appropriate box norm, notice that 

e d ;d-i = &d, e d -ei, e d - e d _i} 

is also in general position. 

To deal with the error term E, we apply the Cauchy-Schwarz inequality one more time to get 

E < E((W(x, t 2 , t d ) - l) 2 Yl w ( x + ^t (d) e d] d-i) ■ h, , ...,t d+1 eZ N ,x£ Z%), 

u/e{o,i} d 

where we have set 

W(x,t 2 ,...,t d ) =E(W d - 1 (x,t 1 ,t 2 ,...,t d ) :h eZ N ) 

and again used the fact that / < w. Now to show that E = o(l), it is enough to show that the 
linear forms defining W are pairwise independent, after of course expanding (W — l) 2 and applying 
the linear forms condition. By following the construction of W, this amounts to showing that at 
each step W\ satisfies pairwise independence, which itself reduces to showing that the coefficient of 
a: is 1 in each form and each form has a nonzero coefficient in t\ (in each coordinate). 
To be more precise, the case I = 1 is immediate. Assuming this is so for I fixed, then 

Wl +1 (x,h,...,t l+2 ) = Wi(x,t u ...,ti +1 )Wi(x,ti+ti + 2,...,ti +1 ) Yl w( x + u't {l) em_i). 

uj'e{o,i}' 

certainly satisfies this, as the the forms in Wi(x, ti, tj+i) and Wi(x, t\ +U +2 , £j+i) are pairwise 
independent because the t\ coefficient is non-zero, and n w 'e{o i} ; w ( x + tt,/ ^^ e J;J-i) * s independent 
of t±. The statement about the coefficient of x is obvious. Also, it not hard to see that the vector 
multiple of t\ is either e; + i or (for forms appearing after i applications of Cauchy-Schwarz). 
Thus the statement is true for I + 1. 

The fact that E = o(l) then follows directly from the (d(d + 2)2 d - 1 ,d(2d + 1)) linear forms 
condition. □ 
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3. The dual function estimate. 

As before we assume that a basis e = {e±, e^} C Z d N is given which is in general position. We 
will use the notation uye = u)\y\e\ + ... + uj^y^e^, for uj G {0, l} d and y G Z^-. First we define the 
dual of a function / : Z d N — )■ R with respect to the || ||n( e )d norm. 

Definition 3.1. . Let f : Zj^ -> R be a given function and let e = {e\, e^} C Z-y 6e a frasis o/ 
Z^r . XTie efa/aZ of the the function f is the function 

Vf{x) = E ( Yl f( x + ute ) (3-1) 

wG{0,l} d ,w^0 

Proposition 3.1. With X and V as above, and e in general position, we have 

\\Vf 1 ...Vf K \\* n(e)d <C(K) 
for any /i, f K G X. 
Proof. We must show that 

(f\Vf 1 ...Vf K )<C K \\f\\ u{e)d 
by the definition of the dual norm. By applying the definition of Vf, the LHS gives 

K 

(f,Vf 1 ...Vf K )=E(f(x)JlE( 11 f i (x + ujt i e):t i eZ d N ):xeZ d N ). 

»=i we{o,i} d ,^o 

Expanding out the products then gives the RHS as 

K 

E(E(/(x) [] ]Jf l (x + ujt i e + ujte):x,teZ d N ):T=(t\...,t K )e(Z d N ) K ) 

wG{0,l} d ,w^0«=l 

after a substitution t l i— >• i + t l for each i for some fixed t, and adding a redundant summation 
in t. Now we call i ? ( tJi T)(a ; ) = TiiLi + wt*e) for non-zero uj, and F^d^^x) = f(x). The last 
expression then becomes 

H(F(u,,T) :uje{0,l} d ) :TeZ d N ). 

By applying the □(e)-Cauchy-Schwarz inequality, we have arrived at 

\\T>f 1 ...Vf K \\* n{e)d < E( H \\F^ T) \\ n{e)d : T G Z%). 

An application of the Holder inequality gives that the RHS is bounded above by 
J] E(\\F {w , T) \\* e)d :Te(Z d N ) K ), 

ui€{0,l} d ,iv^0 d 

where we added one factor of the constant 1 function, which has L 9 -norm one for each q. Thus, we 
now just need to show that for a fixed uj / d we have 

E(\\F { ^ T) \\* e)d :Te(Z d N ) K ) = 0(K) 

for T= (t\...,t K ). 

We continue by expanding the last expression for a fixed co ^ d , 

K 

ll^WIn^ : T G (Z d N ) K ) = O(K) = E( [] [J/^xWeWie) : x,t,t\...,t K G Z%). 

ui'e{o,i} d »=i 
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The RHS factorizes as 

K 

E(JjE( \\ frix + ojye + oo'te) : y e Z d N ) : x,t 

»=1 o/€{0,l} d 

Applying the bound / < u gives 

E(E K ( Yl v(x + ujye + Lj'te):y£Z d N ):x,t£Z d N ). 
w'e{o,i} d 

The inner sum is now split component wise 

d 

[J KMj + ^'te + x)j) :j/eZl), 
j=We{o,i} d 

where the notation (x)j denotes the j th coordinate. The terms (uiye)j represent the linear forms 
Ylt=i UJ sUs{e s )j, which satisfy the hypothesis in the (d, 2 d ) correlation condition by the assumptions 
on e. Hence we have 

d d 

E(Q II K^ye)j + (cj'te + x) 3 ) :yeZ d N )<Y[ £ r{({J - u/')te)j), 

J = lw'G{0,l} d j = lU>'^U>" 

as the (x)j terms drop out in the subtraction. 
Plugging this bound back in gives 



yd \ 

J N) 



E t({(J -J')te) 3 )) K 

Making use of the triangle inequality in C dK , after another application of Holder, reduces our task 
to bounding 

[] £ E(r tf ((( W '- W "H-):teZl). 

By the assumptions on e and the fact that uj' — to" ^ d , ((uj' — uj")te)j provides a uniform cover of 
Ztv, and we may reduce this to 

E(r dK (t) : t e Z N ). 

This expression is Ox(l)- D 

4. Proof of the main results. 

In this section we prove our main results under the assumption that the measure exhibited in [3] 
is (i-pseudo-random, i.e. it satisfies Definition 11.51 

4.1. Proof of Theorem Q Let e = {e 1} . . . , e d } C be a basis which is in general position. 
For a function / : 7L d N ->twe define its dual by 

Vf(x)=E( H f(x + ute):t£Z d N ). (4.1) 

wG{0,l} d ,w^0 

Then clearly 

(f,Df) = \\ffa ie)d (4.2) 

Let /u = (g) i/ be a pseudo-random (i-measure, and let X be the set of functions / on 7L d N such 
that \ f\ < ji pointwise. 
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Lemma 4.1. The norm || || n ( e \d is a quasi algebra predual (QAP) norm, with respect to the set X 
and the operator D. 

Proof. We have already shown part (in) of Definition ll.21 which was the content of Proposition [37TJ 
If ||/||£ (e)d < e then 

(f,Df) = \\f\\$ e)d < e? 
and part (ii) follows. Finally, since |/| < fi it follows 

(/,£>/) <IH5 e) d = l + o(l) 

as the linear forms (x + wie)j are pairwise linearly independent (for each j) and u satisfies the linear 
forms condition. □ 

We are in the position to apply the transference principle to decompose a function < / < \i into 
the sum of a bounded function g and a function h which has small contribution to the expression 
in (1.2). 

Proof of Theorem \l ."A Let a > and let < / < [i be function such that E/ > a, where [i is a 
pseudo-random d- measure on Z d N . We apply Theorem B, with Y = Z, d N , 5 = 1/2 and 77 > 0. Note 
that since \x is a measure one has that = E/i = 1 + o(l). Since || ||n( e ) d * s a Q-^-f norm with 

respect to the set X = {/ : Y — > M, |/| < ^x}, it follows that there is an e > such that if 

llM- l|b( e )«» < £ ( 4 - 3 ) 
then there is a decomposition / = g + h such that 

0<<?<2 and \\h\\ n{e)d < v . (4.4) 

Since \i is pseudo-random — l||n( e ) d = thus (4-3) holds for large enough iV. Using this 
decomposition together with Theorem A and Proposition 12.11 one may write 

E(/(x)/(x + fei).../(x + te d ) : x G i G Z N ) = 

= E{g(x)g{x+t ei )...g(x+te d ) : x G Z d N , t G Z N ) + 0(||/i|| n(e)d ) > d{a,e)-C d -q > c'(a,e)/2 
by choosing 77 sufficiently small with respect to a and e. This proves Theorem 11.21 □ 

4.2. Proof of Theorem rETJ Let us identify [1, N] d with Z%. First we show that constellations 
in Z^f defined by e which are contained in a box B C [l,iV] d of size eN, are in fact genuine 
constellations contained in B. We say that e = {ei, . . . , e^} G Z rf2 is primitive if the segment [0, e] 
does not contain any other lattice points other than its endpoints in Z d ' 2 considered as a lattice 
point in Z rf2 . Let us also define the positive quantity r(e) by 

r(e) = inf |m — x|oo where Ixloo = max \xj\ 

m£{0,e},x£{0,e] l<d 2 

m is running through the lattice points Z rf2 other than and e. 

Lemma 4.2. Let < e < r(e). Ze£ ./V be sufficiently large, and let B = I d be a box of size eN 
contained in [1, N] d ~ Z^. if £/tere exist x G Zfj and i G Ztv\{0} such that 16B and x + te C B 
as a subset on Z d N , then there exists a scalar t! 7^ such that x + t'e Q B also as a subset of Z d . 
Moreover if e is primitive (and 1 < t < N) then one may take t! = t or t' = t — N . 

Proof. First, note that one can assume e is primitive as x + te = x + tse' for a fixed primitive 
e' and s G N. By our assumption, there is an x G [l,A r ] d and t G [1,^ — 1] such that x G B 
and x + tej G B + (NZ) d for all 1 < j < d. Thus for each j there exits m,- G Z rf such that 
|iej — A^mjloo < eN and hence | Ae 

— ^|oo — £j where T7i — {777,1 j ... j 777,^ j- G 2^ and A — t/ N . Since 
< A < 1 and e < r(e) this implies that m = or m = e. If m = then |te|oo < £./V and since 
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x G B it follows that x + te CBCZ d . If m = e then Kt-^e^ < eiV thus x + (t-iV)e QB CZ d , 
so i + t'e C B as a subset of This proves the lemma. □ 

Let us briefly recall the pseudo-random measure v defined in Sec. 9 [IJ. Let w = w(N) be 
a sufficiently slowly growing function (choosing w(N) <C log log N is sufficient as in [1]) and let 
W = Yl p < w P be the product of primes up to w. For given b relative prime to W define the modified 
von Mangoldt function by 

A 6 (n) = ( log(Wra + b) if Wn + b is a prime; 

\ otherwise. 

where 4> is the Euler function. Note that by Dirichlet's theorem on the distribution of primes in 
residue classes one has that ^n<Af A&(n) = iV(l + o(l)). Also, if A C P d is of positive relative a 
and if A^ := ® d A b is the d-fold tensor product of A^ the it is easy to see that there exists a b such 
that 

limsupA^ d Y 1 A(x)A d (x) > a/2 (4.6) 

N -*°° xe[l,N] d 

We will fix such b and choose N sufficiently large N for which the expression in (4.6) is at least 
a/2. Let R = N d 12 and recall the Goldston-Yildirim divisor sum [3J, [5] 

A fl (n)= Y K<t)log(R/d) 

d\n,d<R 

\x being the Mobius function. For given small parameters < e\ < £2 < 1 (whose exact values will 
be specified later) recall the Green- Tao measure 

u(n) = { w log R 11 £ i iV S n < e 2 iV, ^ ^ 

I 1 otherwise. 

Note that v(n) > for all n, and also it is easy to see that for N sufficiently large, one has that 

v{n) > d-^- 6 A b (n) (4.8) 

for all E\N < n < E2N. Indeed, this is trivial unless Wn + b is a prime. In that case, since £\N > R, 
A R (Wn + b)=logR = d~ 1 2- d ~ 5 log N and (4.8) follows. 

Proof of Theorem li.il Set [i = ® d iJ, and let 

g(x) := c d A d (x) l A (x) l [eiN , E2N]d (x) (c d = rt"") (4.9) 

Then by (4.8) one has that g{x) < fi(x) for all x G By (4.6) one may choose a sufficiently large 
number N' for which 

(NT d 1a(*)H(x) > a/2 (4.10) 



nd 



x£[l,N 

and a prime N such that 

(1 - — —)N' < e 2 N < N 
If £1 is such that £i/£2 < a /WOd, then by the Prime Number Theorem in arithmetic progressions 

{N')- d Y K b( x ) < a/10 (4.11) 

x£[l,N'] d \[eiN,e 2 N] d 
It follows from (4.10) and (4.11) 

N- d Y 9{x) > c d N~ d Y 1 A{x)A d b {x) > c d e d a/A (4.12) 

x£[l,N'] d x€[£iN,£ 2 N} d 
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Using the identification [1, N] d ~ Z^, one has that E(g(x) : x € Z%) > a' ( with a' = cjjega/4), 
and < < for all x. Thus, save for proving that the measure v is d-pseudo-random, 
Theorem 11,21 implies that 

E(g(x)g(x + te{) . . . g(x + te d ) : x <G Z%, t G Zjv) > c'(a, e) > 0. 

Note that the contribution of trivial constellations, corresponding to t = 0, is at most 0(N^ 1 log d N), 
as \A d \ < log d uniformly on [1, iV] d . Since the support of g is contained in An[e±N, £2N] d , Lemma 
14,21 implies that An [siN, £2N] d must contain genuine constellations of the form {x,x + te\, ... ,x + 
te^} as a subset of Z d . Choosing an infinite sequence of N's it follows that A contains infinitely 
many constellations defined by e. □ 



5. Appendix: The correlation condition. 

To complete the proof of Theorem 11.11 one needs to show that the measure v defined in (4.7) 
satisfies both the linear forms conditions and the (d,2 d ) correlation conditions given in (1.4). Since 
the measure v is the same (apart from the slight change in the interval where v = 1) is the one 
given in [3] (see Definition 9.3, there), the linear forms condition is already established in Prop. 
9.8 in [4j. It turns out that the arguments given in [3] (see Prop. 9.6, Lemma 9.9 and Prop. 9. 10) 
generalize in a straightforward manner to obtain the more general (mo, mi) correlation condition 
for any given specific values of mo and m±. 



Proposition 5.1. For a fixed mo, mi, there exists a function r such that 

Er k = O k (l) 
and also 



mi mo mo 

n\{\{^{Hy) + Kj)-y^^ r N)<\{( E T (Ki-Ki>)) (5-i) 

i=lj=l i=l l<j<j'<mo 

where the (pi : 7L r N — > Z^r are pairwise linearly independent linear forms. 

Let us first note that the arguments of Lemma 9.9 and Prop. 9.10 of [3] applies to our case and 
it is enough to establish the following inequality (see Prop. 9.6 |4j) 

mi mo 

E (Q II A R( W (^(y) + Kj) + b): yeB) 
i=ij=l 

where M = m\mo and B is a box of size at most R 10M . Moreover one can assume that hij ^ hi j> 
for all i, j jtf. 

The next step is, following [3], to write the the expression 

M 



i=i 



where $i = W((pu/ mi \(y) + /i[i/ m iJ. (» (p)) ^ (L x -I * s ^ ne fl° or function, i(mi) is i modulo mi), to 
as a contour integral of the the following form plus a small error 

M 

t; - l < ■ ; 

(5.3) 



(2m)- M I ... I F{z,z')\\?-^dz 3 dz] 
jTi jFi Zj z j 
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where z = (zi, Zm), z' = (z[, z' M ), and function F(z, z') is taking form of an Euler product 

F(z,z') = l\E p (z,,z'), 

p 

where 

E(z ^_ v (-') |I|t|f| "-vu^) 

X,X'C[Af] ^ 

The function oj relates this expression to the particular forms. Specifically 

Lemma 5.1. (Local factor estimate). Set the intervals Ii = [(i — l)mi + l,imi] as a partition of 
[M]. For a € Ii, the homogeneous part of 9 a is Wfa. Also, set Aj = Tlj<j- j j'eh ~ fkj'\.The 
following estimates hold: ux(p)-' 

(1) If P < w(N), then &x(p) = 1 if \X\ = 0, and is otherwise. 

(2) Ifp > w(N) and \X\ = 0, then w x (p) = 1. 

(3) If p > w(N) and X C Jj is nonempty, we have wx(p) = P~ X when \X\ = I, and Wxip) < 
p^ 1 when \X\ > 1. In the latter case, if ' p\ A a , we have that lox{p) = 0. 

(4) If p > w(N) and X D Ij 7^ and X n /j/ 7^ /or some i 7^ i', we Ziaue u)x(p) < ■ 

Proof. When p < w(N), then W0j + 6 = o(p), giving the first result. The second statement is 
trivial. 

For the third statement, let us start with X C Ij with [X| = 1. Then we have 

E ( 1 ^i(l/)+Ai lJ 0+6sQ(p) : V G z tv) = 7>~ 
for any fixed j, proving the first part. The second part requires an estimate of 

^( 1 W(My)+^,j)+b=0(p) 1 W(My)+K, 3 ')+b=0(p) V G 

with j 7^ j' . If p\ \h a j — h a j'\, then the we are left with simply a single equation (p\ W), and may 
refer to the first part. When p\ A a , ooxip) = as h^j is not congruent to hij' t modulo p. 
For the last statement, we have the upper bound 

^{^W(^{y)+h^ j)+b=0 (p) 1 W(^(y)+/ ll , tj >)+b=0 (p) : V & 

for some % 7^ i' and j,f- The forms fa and far are linearly independent modulo p (see the proof of 
Lemma 10.1 in [1]), hence we have the intersection of two distinct linear algebraic sets, which has 
size at most p r ~ 2 . □ 

The terms E p in the Euler product can be separated as 

At 

E p (z,z') = 1 - l p>w{N) ^(p- 1 ^ +p- 1 - z > -p- 1 -^) 

3=1 

+ Y^ 1 P>w(N);p\A l ^p\z,z') + ^ ' 

i=1 X\JX'p a ,ae[mi];\X\JX'\>l 

where 



40 M= E r Mi £ ] > 

xux'ch-Axux'^i P 
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We define the terms 

mi 

Ej® = 1 + ^ lp>w{N);p\Ai^pH z i z '), 
i=l 

and factorize E p = EpEpEpEp as follows: 



EpUjLli 1 - lp>w(N)P 1 2j )( 1 - lp>w(N)P 1 *0Q- - \>w{N)P 1 Zj Zj ) 1 
M 

M 



4 3) = n^-^oa 

3=1 

and set Gi = \\ Ep , noting that 



3 y i c(i+^-)C(i+^)- 



The the following is the analogue of lemma 10.6 in [3]. To state it, Let us recall the domain T>ff 
to be the set 

{Zj,z% : Sfcs,-, SfcsJ- e (-CT, 100) , 1 < i < M}. 

We also have the norms on for / analytic on V^f , denoted ||/||cfcmAf), given by 

ii/iic> (W = -pii(^r-(^r'(| [ r...(4 ; r/ii £ - OT , 

where the supremum is taken over all a\, om, whose sum is at most k. 

Lemma 5.2. Let < a = 1/(6M). Then the Euler products Gi are absolutely convergent for 
i = 0,1,2 in the domain T>ff , and hence represent analytic functions on this domain. We also have 
the estimates 

\\Go\\cr i vM) = OM(log(R)/\oglog(R)) r J] {l + M (p 2Ma - 1 )) 

Pins a, 

\\Go\\cM iV M (i M ) < exp(0 M (log 1 / 3 (i?))) 
\\ G i\\c M (vf /6M ) ^ °m(1) 

\\G2\\ e M(pM m ) < MtW ^(l) 
mi 

Go(o,o)=nn( i +°^^ i/2 )) 

i=lp\Ai 

Gi(0,0) = l + o M (l) 

G 2 (0,0) = WW) M 
where the first bound is for all < r < M . 
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Proof. The estimates proceed exatctly as in Lemma 10.3 and Lemma 10.6 in [3] with A = YYl^i A»> 
barring the statement about Go(0,0). To see this, we have 

mi mi 

g (o, o) = n 4°) = na + e a ? } (°' °))<nii( i+ i a ?(°> °)d 

p[A p|A i=l i=lp|Aj 

and we crudely have |A?(0,0)| = 1 + A /(p~ 1/2 )- □ 
The expression in (5.3) takes the form 



(2m)' M 



with 67 = GqG\G2- To estimate it let us recall the following general result on contour integration 
from [4], see Lemma 10.4 there. 

Lemma 5.3. (Goldston-Yildirim [I] [6],) Let R be a positive number. IfG(z,z') is analytic in the 
2M variables on T>^ for some a > 0, and we have the estimate 

\\G\\ cH vm) = exp(0 M , ( x(log 1/3 ( J R))), 



then 



" c(i + 2j + 4)-R I,+: 

{■> 

M 

= G(0 : ...,0)log M (R) + Y,O M A\\G\\eHvM ) )log M -^R) + O M , a (eM-^ 

3=1 

for some S > 0. 

Estimate (5.2) follows easily applying Lemma 5 (with a = 1/6M) to G = GqG\G2 using Lemma 
4, which in turn implies Proposition 15.11 where the function r is defined precisely as in [4j. This 
finishes the proof of Theorem 11.11 



(2^r A/ 1 - I g(z,z')T\ ; ( l+z i +z i ] R3 3 dz dz > 
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